Robust clusterwise linear regression through trimming

نویسندگان

  • Luis Angel García-Escudero
  • Alfonso Gordaliza
  • Agustín Mayo-Iscar
  • R. San Martín
چکیده

The presence of clusters in a data set is sometimes due to the existence of certain relations among the measured variables which vary depending on some hidden factors. In these cases, observations could be grouped in a natural way around linear and nonlinear structures and, thus, the problem of doing robust clustering around linear affine subspaces has recently been tackled through the minimization of a trimmed sum of orthogonal residuals. This ‘‘orthogonal approach’’ implies that there is no privileged variable playing the role of response variable or output. However, there are problems where clearly one variable is wanted to be explained in terms of the other ones and the use of vertical residuals from classical linear regression seems to bemore advisable. The so-called TCLUST methodology is extended to perform robust clusterwise linear regression and a feasible algorithm for the practical implementation is proposed. The algorithm includes a ‘‘second trimming’’ step aimed to diminishing the effect of leverage points. © 2009 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy clusterwise linear regression analysis with symmetrical fuzzy output variable

The traditional regression analysis is usually applied to homogeneous observations. However, there are several real situations where the observations are not homogeneous. In these cases, by utilizing the traditional regression, we have a loss of performance in fitting terms. Then, for improving the goodness of fit, it is more suitable to apply the so-called clusterwise regression analysis. The ...

متن کامل

Regularized fuzzy clusterwise ridge regression

Fuzzy clusterwise regression has been a useful method for investigating cluster-level heterogeneity of observations based on linear regression. This method integrates fuzzy clustering and ordinary least-squares regression, thereby enabling to estimate regression coefficients for each cluster and fuzzy cluster memberships of observations simultaneously. In practice, however, fuzzy clusterwise re...

متن کامل

Clusterwise PLS regression on a stochastic process

In this paper we propose to use the PLS approach for clusterwise linear regression in the particular case where the set of predictor variables forms a L2-continuous stochastic process {Xt}t∈[0,T ]. We have adapted the k-means algorithm to this case and we give necessar conditions for its convergence. The results of an application of the clusterwise PLS regression to stock-exchange data are comp...

متن کامل

Robust nonparametric kernel regression estimator

In robust nonparametric kernel regression context,weprescribemethod to select trimming parameter and bandwidth. Through solving estimating equations, we control outlier effect through combining weighting and trimming. We show asymptotic consistency, establish bias, variance properties and derive asymptotics. © 2016 Elsevier B.V. All rights reserved.

متن کامل

PCR and PLS for Clusterwise Regression on Functional Data

Clusterwise regression is applied to functional data, using PCR and PLS as regularization methods for the functional linear regression model. We compare these two approaches on simulated data as well as on stock-exchange data.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 54  شماره 

صفحات  -

تاریخ انتشار 2010